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The Pichia pastorls system for expression of hetero- 
logous recombinant proteins is being used increasingly 
because of the large yields of properly folded proteins 
that result and the ease of scaling preparations into 
large-blomass fermentors. Another advantage of this 
system centres on the type of glycosylation that results, 
generally yielding protein-bound oligosaccharides that 
are of much shorter chain length than found in Sacchare- 
myces cerevWoe. This review is a summary of the 
current state of knowledge of glycosylation of proteins 
in this methylotrophic yeast. 



Introduction 

The Pichia pastorls expression system has been used ex- 
tensively over the past decade for generation of recombinant 
proteins, and the nature of the processing that occurs In 
yeast has become a topic of interest Yeast can carry out 
glycosylation of the amide nitrogen of asparagine residues in 
a protein when found in the consensus sequence Asn-Xaa- 
Thr/Ser, providing N-linked glycosylation. Also, glycosyla- 
tion of hydroxy groups of threonine and/or serine residues 
in proteins occurs in yeast cells, yielding O-llnked type of 
glycosyiation [I]. As delineated mainly in Soochoromyces 
cerev/s/ae, protein glycosylation Involves the following sum- 
marized events. N-linked oligosaccharides originate from an 
oligosaccharide (Glc,Man,GlcNACj) that is assembled on 
dolichol (pyrophosphate) in the endoplasmic reticulum (ER), 
and that is transferred to the appropriate Asn of the nascent 
protein in a co-translational event [2,3]. This is a common 
eukaryotic-cell-type glycosylation pathway for N-linked 
glycoproteins. However, differences In subsequent pro- 
cessing of the newly glycosylated protein occur in yeast as 
compared with higher eukaryotic cells (plants, insects and 
higher animals). In most yeasts that have been studied, the 
three glucose residues and one specific a-l,2-linked mann- 
ose residue are removed by specific glycosidases in the ER 
[4], giving a protein N-linked Man,GlcNACj core structure 
that Is -further processed in the Golgi complex [5,6]. This 
involves addition of a mannose residue that is linked a- 1 ,6 to 
the a-l,3-linked mannose residue in the Mana-l,3Man/J- 



1 ,4GlcNAc inner-core sequence. The 1 ,6-linked residue can 
then be elongated to an a-l,6-linked backbone (50-100 
residues) that can be branched with a-l,2-linked di- or 
tri-saccharides of mannose that in turn can be capped with 
oc- 

1,3-linked mannose units. An additional processing event 
that occurs in most yeasts is the addition of phosphate, in 
the form of mannose I -phosphate, to specific a- 1,2- or 
a-l,6-llnked mannose residues of the core or the side chain 
of N-llnkad oligosaccharides, forming the acidic phospho- 
diester component found in many yeast glycoproteins [7]. 
Complex-type oligosaccharides containing sialic acid, gala- 
ctose, fucose and N-acetylgalactosamine are not found on S. 
cereWs/oe-derlved, and most other yeast-derlved, glyco- 
proteins. Therefore, the assembly and processing of only 
the high-mannose type of N-llnked oligosaccharides on the 
expressed recombinant protein mus: be taken into account 
when considering the structure and the subsequent use 
(function) of the glycoprotein. Whereas cell-wall mannans 
from some strains of P. pastoris have been found to contain 
/2-1,2-linked mannose residues, such as present in the 
structure Man(/M,2)Man(/?-l,2)Man(a-l,2)Man, ^-linked 
mannose residues have not been reported in recombinant 
glycoproteins from P. pastoris [8,9]. In addition to the N- 
linked oligosaccharides, the presence of O-linked oligo- 
saccharides of mannose is common in most yeasts [10]. The 
biosynthetic process is unique compared with animals, in 
that the mannose residue that is «-glycosidically linked to 
the hydroxy group of a serine or threonine residue of the 
protein is derived in the ER from a dolichol phosphate 
carrier, dolichyl phosphate mannose, rather than from a 
sugar nucleotide in the Golgi complex. These short oligo- 
saccharides can also be phosphorylated and thus contribute 
to die acidity and perhaps other functions of the glycoprotein 

The status of glycosylation of recombinant proteins 
from P. pastoris has been presented in earlier reviews of 1 992 



Abbreviation i*ed; ER. endoplasmic reticulum; endo H, endo-£N- 
acetvlglucosiminidase H; PNGase F, peptide N-glycohydrolase; DE-TOF- 
MALD) MS, delayed-extraction time-of-ilight matrix-assisted laser desorption 
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[I I] and 1993 [12], and briefly In a recent review [I]. The 
purpose of this current review Is to summarize the 
Information that has been acquired over the past 10-12 
years on the structures of oligosaccharides of recombinant 
proteins from P. pastoris. The large body of information 
available from previous studies with S. cerews/ae has greatly 
aided In understanding and elucidating these structures. 



Gel-shift assays 

Making use of the knowledge that only hfgh-mannose-type 
oligosaccharides are found on yeast glycoproteins has 
afforded a relatively simple experimental method for de- 
termining whether N-glycosylatlon has occurred on the 
recombinant protein. The availability of enzymes that 
recognize the high-mannose core structure and catalyse 
hydrolysis of the GlcNAc£-l,4GlcNAc linkage In the core, 
resulting in release of the oligosaccharide, generally results in 
sufficient decrease In apparent mass that can be detected 
using SDS/PAGE. Endo-jff-N-acetylglucosamlnidase H (endo 
H) Is one such enzyme that has been used extensively for this 
purpose. The other type of enzyme utilized for release of the 
N-linked oligosaccharide is not specific for high-mannose or 
complex-type structures, and catalyses hydrolysis of the 
GlcNAc^-Asn linkage. This enzyme Is peptide-N-glyco- 
hydrolase F (glycopeptidase F or PNGase F), and it has been 

Tabic I Endoglycosidase/ge! shifts as evidence for glycosylate 



Endo F, endo-^-N-acetylglucosaminidase F. 



Change in molecular 

Protein mass (kDa) Glycosidase Reference 


S. censvte/oe invertase 


100-140 to SB 


EndoH 


[13] 


HIV envelope protein 


200 to 59 


EndoH 


[36] 


Boophflus microplus Bm86 antigen 


90-100 to 70 


Endo H 


P7] 
P8] 


BocflJus amylollquefoaens a-amylase 


Smear to 60 


EndoH 


Bovine enterokinase catalytic subunit 


46 to 33 


EndoH 


[39] 


Rabbit anglotensin-convertirig enzyme 


90 to 70 


PNGase F 


[40] 


5. cerevisioe a-U-mannosyltransferase (Ktrlp) soluble domain 


Reduced by ZS 




[41] 


Bovine herpes virus type-l glycoprotein D 


68 to 39 


EndoH 


[42] 


Human prolyl 4-hydroxylase o-subunit 


Three bands to one 


PNGase F 


[43] 


Mouse gelatinase B 


110 to 95 


PNGase F 


[44] 


Human angfostatin (Pg KI-4) 


51 to 49 




[45] 


Bovine opsin 


Reduced 


Endo H or PNGase F 1 


[46] 


Human costimulatory B7-I and B7-2, and 87 receptor CTUA-4 


Reduced 


PNGase F 


[47] 


Golgi sialoglycoprotein MGI 60 


I40to 130 


Endo H or PNGase F 


[48] 


Human a,-antitrypsin 


52 to 46 


EndoH 


[49] 


Plasminogen activator inhibitor- 1 


84-100 to 50 


PNGase F 


[50] 


Human placental alkaline phosphatase 


62 and 65 to 58 


EndoH 


[51] 




61 to 55 and 38 


EndoH or PNGase F 


[52] 


Bla g 4 cockroach allergen 


23to IBand 19 


Endo F/PNGase F 


[53] 


Human tissue factor 


37-^5 to 33 


EndoH 


[54] 


Human CDS- 1 10 


22.9 to 14.8 


EndoH 


[5536] 


Trypanosoma auri acid a-mannosidase 


65 to 57 and 46 


Endo Hot PNGase F 


[57] 


Porcine follicle-stimulating hormone 


23 to 15,5 and 24 to 18 


Endo F/PNGase F or endoH 


[58] 


Carcinoembryonic antigen domains N and A3 fusion 


37 to 24 


PNGase F 


[59] 


Type-ll activin receptor extracellular domain 


30 to 25 


EndoH 


[60] 


Bovine herpes virus type-l glycoprotein bovine interleukin-6 chimaera 


100 to 59.5 


EndoH 


[61] 
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utilized extensively for demonstrating shifts in SDS/PAGE 
mobilities as an indication of N-glycosylation. Reports where 
N-glycosylation has been demonstrated only by use of either 
endo H or PNGase F in a gel-shift assay are listed In Table I . 
The observed differences in apparent masses before and 
after endo-enzyme treatment are shown. 



N-linked oligosaccharides from 
P. pastoris 

Much of the current knowledge of the structures of the 
asparagine-llnked oligosaccharides that are assembled on 
recombinant proteins from P. pastoris has been obtained 
with recombinant S. cerevisfae invertase. Expression in P. 
pastoris of the SUC2 gene for this protein gave a secreted 
glycosylated Invertase that was shown to contain less 
carbohydrate, due to shorter asparagine-iinked oligo- 
saccharides, than the invertase secreted by S. cerevislae [13]. 
The majority (85%) of the endo H-released oligosacchar- 
ides from the P. postoris-expressed invertase were of 
Man^GlcNAc sizes, with the remainder being much larger 
(Man >M GlcNAc size) but still shorter than the long oligo- 
saccharide chains (Man >S0 GlcNAc) found on the native S. 
cerevis/ae invertase [14]. A detailed 'H-NMR study of the 
structure of the dlfferent-chain-length oligosaccharides that 
were released by endo H from the heterologous Invertase 
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and fractionated on BioGel P-4 was conducted [IS]. The 
major conclusions were: the Man,GlcNAc has the normal 
yeast processing structure that iacks the a-l,2-Man on the 
middle arm of the Man,GlcNAc In the dolichol-linked 
precursor oligosaccharide; 95% of the Man,GlcNAc has an 
a- 1 ,6-linked Man on the a- 1 ,3-Man arm of the Man„GlcNAc; 
80% of Man l0 GlcNAc has an cc-l,2-Man on the new a- 1, 6- 
Man of Man,GlcNAc, 12% has an <z-l,2-Man on other 
a- 1 ,2-Man termini, and 8% has an a- 1 ,6-Man on the new a- 
1,6-Man of Man,GlcNAc; the Man„GlcNAc has four struc- 
tural isomers, the major one (82%) being the major 
Man, 0 GlcNAc with an additional a-l,2-Man on one of the 
three a-l,2-Man termini, and the other three structures 
having an a- 1 ,2-Man on either one of the two new a- 1 ,6-Man 
residues or on the a- 1 ,2-Man terminus of the a- 1 ,6-Man arm 
of the Man,„GlcNAc. Significantly, no new a- 1 ,3-Man linkages 
were observed in these structures, Implying that ft pastoris 
does not have, or does not utilize, a-l,3-mannosy! trans- 
ferases for the processing of glycoproteins. Assuming that 
the dolichol-linked oligosaccharide in P. pastoris has the same 
Man,GlcNAc structure as found in other eukaryotic cells, 
only the specific a- 1 ,2-mannosidase (for generating the yeast 
protein-linked Man 8 GlcNAci), a- 1 ,2-mannosyl transferase^) 
and a-l,6-mannosyl transferase^) would be required for 
processing of newly synthesized glycoproteins. The lack of a- 
1,3-Man linkages In the recombinant invertase oligosac- 
charides was later confirmed by studies showing that 
membranes from P. pastoris cells did not exhibit a-1,3- 
mannosyl transferase activity with GDP-Man as mannosyl 
donor and appropriate mannose-containlng oligosaccharides 
as acceptors, whereas a- 1,6- and a- 1 ,2-mannosyl transferase 
activities were present [16]. 

A comparative study of N-linked glycosylation on six 
different recombinant proteins secreted from P. pastoris 
revealed some differences in chain lengths of the PNGase 
F-released oligosaccharides [17]. The relatively short 
Man„GlcNAc and Man,GlcNAc oligosaccharides were 
most commonly observed. Shorter oligosaccharides 
(Man,GlcNAc) were abundant only on yeast (S. cerevfciae) 
Invertase, and longer oligosaccharides (Man| 0 GlcNAc up to 
Man^GlcNAc) were abundant only on fungal (Mucor pusUlus) 
aspartic protease and bovine enterokinase catalytic subunit, 
The other three proteins, fungal (Penkifflum minioluteum) 
dextranase, bacterial (bacillus licheniformis) a-amylase and 
tick (Boophilus microphis) gut antigen, contained predom- 
inantly Man,GlcNAc and Man,GlcNAc oligosaccharides. In 
this latter study, little differences in the size of the 
oligosaccharides could be correlated to whether the re- 
combinant protein was prokaryotic or eukaryotic, com- 
position of the growth medium or culturlng in shake flask or 
fermentor. As the same Integrative expression vector 
containing the yeast invertase signal sequence and the 
glyceraldehyde-3-phosphate dehydrogenase terminator se- 



quence was used for expression of all proteins but the 
enterokinase, differences In transcriptional regulation would 
be considered to be minimal. However, the data suggest an 
inverse correlation of the oligosaccharide size with the 
number of potential N-linked sites and the expression level. 
It is suggested that the flow of newly synthesized glyco- 
protein through the secretory pathway might be so great 
as to -not allow sufficient or proper processing of the 
oligosaccharide chains. In this case, use of weaker promoters 
other than the strong AOXI promoter might allow en- 
hanced glycosylation processing. 

The N-glycosylation by P. pastoris of Asn-Xaa-Thr/Ser 
sites in prokaryotic proteins that are normally not glyco- 
sylated in their native hosts has been studied and was clearly 
demonstrated with the a-amylase of Bacillus licheniformis 
[18]. The protein contains six N-linked sites that could 
potentially be glycosylated. A decrease In mass from 
approximately 90 to 60 kDa after endo H treatment of the 
expressed protein indicated that substantial N-glycosylatlon 
had occurred. Location of the glycosylated sites in the 
recombinant protein was based on trypsin digestion before 
and after treatment with endo H, and subsequent frac- 
tionation of the peptides by HPLC. Monitoring of the 
peptides with concanavalin A and wheatgerm agglutinin 
lectins (as their horseradish peroxidase conjugates), along 
with peptide sequencing and MS, allowed assignment of 
peptides that did contain carbohydrate as the original high- 
mannose oligosaccharide or as the residual GlcNAc after 
endo H treatment Although the structures of the oligo- 
saccharides were not investigated, five of the six potential N-. 
sites were ass igned as being glycosylated by these techniques. 

Glycosylation of other recombinant fungal enzymes 
from P. pastoris has been reported. An Aspergillus fumigatus 
catalase gene has been expressed in P. pastoris [19], The 
protein has four potential N-linked sites, and glycosylation of 
the recombinant protein was indicated by reaction with 
concanavalin A-peroxidase conjugate on a gel blot No 
other structural information on the oligosaccharides was 
presented. Expression of glucoamylase of Aspergillus awamorii 
yielded a recombinant protein that was analysed for neutral 
sugar with the phenol/sulphuric acid colorimetiic pro- 
cedure, and for mass by SDS/PAGE and MS [20]. The results 
were consistent with the recombinant enzyme containing 96 
neutral hexose molecules. The structures of the carbo- 
hydrate units were not further investigated. 



Modification of the Asn-linked 
glycosylation pathway in yeast 

Attempts to genetically modify glycosylation processing in 
P. pastoris were carried out [21]. Recombinant strains of P. 
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pastoris containing an integrated cDNA for either secreted 
haemagglutlnln or Influenza neuraminidase were further 
transformed with a fuli-length cDNA for a- 1 ,2-mannosidase 
from Tr/chodermo reesei or with a fusion cDNA of the 
catalytic domain of the T. reese/ a- 1 ,2-mannosidase and 
the ER retention signal for the S. cerevisiae MNSI protein 
(specific ER-processing a- 1 ,2-mannosidase). Oligosaccha- 
rides from neuraminidase secreted by cells containing the 
heterologous a- 1 ,2-mannosldase were shown to be partially 
reduced in size from Man <H GlcNAc observed in control 
cells to Man^GlcNAc, implying that some additional trim- 
ming of mannose residues was occurring with the flt-1,2- 
mannosldase. As the T. reesei mannosidase was observed to 
be secreted from the P. pastoris cells, its catalytic domain was 
anchored to the ER by using the fusion cDNA containing the 
S. cereWs/ae retention signal. Neuraminidase secreted by 
these P. pastoris cells did not contain the shorter Man <[4 G- 
IcNAc oligosaccharides, and secreted hemagglutinin oligo- 
saccharides contained additional short Man^GlcNAc and 
longer hyperglycosylated components. It was suggested that 
the presence of the heterologous a- 1 ,2-mannosidase al- 
lowed formation of better substrates for mannosyltrans- 
ferases, and thus the formation of larger mannosylated 
oligosaccharides. Further investigations of this type could 
result in expression of recombinant proteins that would not 
be compromised In structure or function by the large 
mannosylated side chains. 



Identification of sites of N-linked 
glycosylation in glycoproteins 

The location In the polypeptide chain of the Asn-Xaa- 
Thr/Ser glycosylation site, and the presence of tandem or 
overlapping sites, were shown to have significant effects on 
the size of the oligosaccharide chains In recombinant single- 
chain Fv antigen-binding protein (scFv) [22]. Having a single 
site near (1-5 residues) the C-terminal end of the protein 
resulted In only core (Man,_ M GlcNAc) glycosylation, with 
high efficiency (95%) for the five-residue-spaced site. The 
presence of tandem sites near the C-termlnal end induced 
hyperglycosylation (increase in mass by 9-15 kDa), with 
three tandem sites being completely hyperglycosylated. 
Interestingly, the presence of a single two-site overlapping 
sequence (NNTT) adjacent to the C-terminal serine residue 
resuited In core glycosylation (minor) and hyperglycosylation 
(major). It was therefore suggested that the sequence NNTT 
deiineates a minimal sequence for one form of hyperglyco- 
sylation. But it Is aiso cautioned that In these experiments 
the N-glycosylation sites are engineered to be very near the 
C-terminus of the recombinant protein, a glycosylation site 
that Is found rarely in native proteins. 



Characterization of glycosylated 
recombinant proteins from P. pastoris 

Use of expression vectors for P. pastoris that contain the 
yeast a-mating-factor signal sequence (such as pPIC9) can 
give results on glycosylation that must be interpreted 
carefully because of the potential N-glycosylatlon sites in this 
secretion signal sequence. An example of this is the report 
on expression of human procarboxypeptidase A2 (pro- 
CPA2) in P. pastoris [23]. The pPIC9 vector was used to 
construct the expression vector for secretion of the 
recombinant protein. On SDS/PAGE, a major 45-kDa 
secretion product corresponding to proCPA2 was obtained, 
and also a smear of hlgher-molecular-mass immunopositlve 
material that was reduced to 50-60 kDa mobility with 
PNGase F treatment. This latter protein corresponded to 
<xMF-proCPA2, and was Interpreted to , result from In- 
complete in vivo processing of the glycosylated construct by 
the P. pastoris KEX2 protease. N-termlnal amino acid 
sequencing of recombinant proteins is generally required 
and utilized to ascertain that proper protease processing has 
occurred. 

Proteins that are expressed in glycosylated forms by P. 
pastoris are suitable for crystallization and subsequent 
structural analysis. The glycosylated oc E C protein domain of 
human fibrinogen-420 was expressed in P. pastoris and 
shown to contain oligosaccharides released by PNGase F of 
sizes Man^GlcNAc, and minor amounts of phosphorylated 
components (ManjPGIcNAcJ [24]. A segment of the ex- 
pressed domain containing residues 647-847, including the 
N-glycosylation site at Asn" 7 , was crystallized for structural 
analysis. Electron density at Asn" 7 in the resulting electron- 
density map was attributable to the N-linked GlcNAc of the 
carbohydrate cluster [25]. 

A more detailed account of the ability to crystallize P. 
pastorfs-expressed and -glycosylated proteins has been 
reported for human pancreatic a-amyfase [26]. It was shown 
by electrospray ionization MS that the protein contained one 
N-linked oligosaccharide of sizes Mane., ,GlcNAc z , and in this 
glycosylated form was not crystalllzable. However, after 
treatment with endo-jS-N-acetylglucosamlnldase F, the pro- 
tein was crystallized and the electron density at Asrf"' shown 
to fit a GlcNAc residue. Important and Interesting results of 
this glycosylatlon-site assignment were thatAsn 441 is not part 
of an Asx or /?-turn that favour glycosylation, and that the 
cysteine residue in the Asn"'-Cys-Thr consensus sequence 
Is In dlsulphide linkage with Cys' 11 ' In the native protein. Few 
other proteins have been found to have this structure 
around a glycosylation site. 

Structures of the various glycoforms of the recom- 
binant krlngle 2 domain of tissue-type activator (r-KJ,^ [27] 
have been analysed In our laboratories [28,29]. Release of 
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Residues shown in regular type are derived from the GICjMan,GlcNAc, 
oligosaccharide that is assembled through the dollchol pathway, transferred to 
the protein, and further processed by glucosidases I and II, and an ER-localized 
a-l,2-mannosidase. The resulting Man a GlcNACj core structure has been 
retained in alt of the proposed structures. The other mannose residues present 
in the oligosaccharides (italicized) are derived from later processing reactions. 
Glycosidic linkages from the Man-GlcNAc-QcNAc-R component of the core 
structure are displayed as -(a-l,2)/(a-l,3) and\(a-l,6) linkages. M, mannose; 
GN, GlcNAc This Figure is reproduced from [28] © 1997 Portland Press on 
behalf oflUBMa 



the oligosaccharides from the protein by PNGase F or by 
hydrazlnolysis gave identical profiles on subsequent chroma- 
tography of the fluorescendy labelled products, indicating 
that only N-linked saccharides were present Fractionation 
by gel filtration on BloGel P-4 or by HPLC on amino-silica 
revealed a series of oligosaccharides that corresponded to 
Mang.nGlcNAcj, with the major components (75%) being 
MatVijGkNAcj. Structures of the major Individual oligo- 
saccharides were analysed by using a combination of a- 
mannosldases (non-specific jack bean, 1 ,2-specific AspergiWus 
sa/tot and 1,6-specific Xanthamonas manihotis), gel filtration 
and delayed-extraction time-of-flight matrix-assisted laser 
desorption ionization MS (DE-TOF-MALDI MS). Results of 
these analyses could be accounted for by a Man,GlcNAc 1 
core structure (yeast type) having an extension on the core 
a-l,3-Man arm of up to four a-l,6-Man units and up to 
two a-l,2-Man units on either the Man,GlcNAc2 core or 
the a-l,6-linked side-arm extension. No a-l,3-Man glyco- 
sidic linkages were observed (or required to account for the 
experimental data). The types of structures that can be 
observed are summarized in Rgure I . All of these results are 
in agreement with earlier structural and enzymic studies 
pertaining to P. pastoris, and with a large body of knowledge 
pertaining to protein glycosylation in S. cereWsiae. 

O-glycosylation of proteins from 
P. pastoris 

Evidence for O-glycosylation of recombinant proteins ex- 
pressed in P. pastoris Is limited. An early study involving miid 
alkaiine treatment of glycoproteins of glycoproteins from 



the periplasmic space and from secretions of cells expressing 
heterologous invertase indicated by gel filtration that O- 
linked oligosaccharides were present but were not major 
components as. compared with the N-linked oligosacchar- 
ides [14]. For recombinant human lysosomal a-mannosidase, 
some carbohydrate was resistant to removal by endo H and 
N-glycanase, as indicated by retention of staining with 
Galanthus nivalis lectin ; this was attributed to the presence of 
O-lfnked mannose structures [30]. A human single-chain 
urokfnase-type plasminogen activator protein without an N- 
glycosylation site (Asn -► Gin mutation) was expressed and 
secreted, and was shown to stain on a gel blot for carbo- 
hydrate [31]. It was therefore concluded that the 
carbohydrate was O-linked to this recombinant protein. 

Two studies having direct structural evidence for O- 
mannosylation of expressed proteins in P. pastoris have since 
appeared. One was conducted with recombinant forms of 
mouse gelatlnase B [32]. Several glycosylated truncated 
variants of this protein were expressed, ranging in size from 
H 2 to 60 kDa on gel electrophoresis. Variants (75 and 
60 kDa) lacking N-glycosylatlon sites were still stained with 
G. nivalis lectin, suggesting that O-linked saccharides were 
present. Release of bound saccharides by hydrazinolysls 
yielded oligosaccharides ranging In size by HPLC from mono- 
to pentasaccharides, and which were completely degraded to 
mannose by jack-bean a-mannosidase. These results are 
consistent with O-linked saccharides of mannose being 
assembled on serine or threonine residues of the newly 
synthesized recombinant protein. The second study of O- 
glycosylation in P. pastoris was carried out in our laboratory 
with recombinant kringle 1-4 domain of human plasminogen 
[33]. Release of saccharides from the expressed protein was 
accomplished by hydrazinolyis and with alkaline ^-elim- 
ination. Both procedures yielded oligosaccharides up to 
pentamer in size (gel filtration) that were completely 
degraded to monomer by jack-bean a-mannosidase. Treat- 
ment with a specific a- 1 ,2-mannosidase degraded essentially 
all of the saccharides to monomer, indicating that no a- 1 ,3- 
linked mannose units were present in the O-llnked oligo- 
saccharides. 



Phosphorylation of oligosaccharides in 
P. pastoris 



The presence of phosphate In S. cerevis/oe invertase that was 
expressed and secreted by P. pastoris was demonstrated In 
two ways [14]. Retention of some of the endo H-released 
oligosaccharides on anion-exchange resin (quaternary 
aminoethyl-Sephadex) was attributed to negatively charged 
phosphate groups. Also, metabolic labelling of cell cultures 
with W P and subsequent isolation of the invertase-associated 
oligosaccharides revealed the presence of radioactivity in 
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the hyperglycosylated Man J0 GlcNAc fraction and In the 
Man| S . K GlcNAc fraction. Alkaline phosphatase treatment of 
these oligosaccharides did not release phosphate. Thus it 
was concluded that the phosphate was not mono-esterified. 
Similar techniques and results were reported for the 
recombinant head domain of Influenza neuraminidase [34]. 
In addition, acid hydrolysis of the neuraminidase-llnked 
oligosaccharides released small amounts of glucose phos- 
phate and mannose phosphate, as demonstrated by high- 
performance anion-exchange chromatography [21]. For 
recombinant chicken a-N-acetylgalaetosaminidase, it was 
suggested from MS data of tryptic glycopeptides that one 
phosphate group was present on some of the 
Man^ l4 GlcNACj-bound structures [35]. But for the o^C 
domain of human fibrinogen 420, it was suggested from 
mass-spectral analyls that the one N-glycosylation site 
contained approximately one-third of phosphorylated 
ManjGlcNACj, the remainder being Man^jGlcNAcj (non- 
phosphorylated i) [24]. A comparative study of the N-llnked 
oligosaccharide structures on six hydrolytic-type enzymes 
that were expressed in P. pastoris revealed some unique 
features [17]. In particular, on only one (aspartic protease) of 
the expressed proteins were phosphorylated oligosacchar- 
ides detected. Changes In HPLC and FACE mobilities after 
mild acid hydrolysis and alkaline phosphatase treatment 
suggested that the phosphate was present in diester form 
between mannose residues on Man l0 .„GlcNAc a . 

We have analysed the acidic fraction of oligosaccharides 
that are assembled on the recombinant kringle 2 domain of 
human tissue-type plasminogen activator, when expressed in 
P. pastoris [29]. All N-llnked oligosaccharides were released 
with PNGase F and the pool of fluorescently labelled 
oligosaccharides fractionated on an amlno-silica column into 
a family of neutral and a family of charged components. 
Analysis of the charged components by DE-TOF-MALDI MS 
revealed masses that corresponded to individual major 
components from Man l(w< GlcNAc 1 , plus 80 mass units, 
suggestive of a phosphate group being present Treatment of 
these components with alkaline phosphatase did not alter 
the masses, Indicating that the phosphomonoester linkage 
was not present Treatment with mild acid hydrolysis 
followed by alkaline phosphatase reduced the masses by 1 32 
mass units for a hexose and 80 mass units for a phosphate 
group, as expected for a phosphodiester linkage between an 
anomeric hydroxy and a primary (or secondary) hydroxy of 
two mannose units. The pool of observed charged oligo- 
saccharides would then correspond to the Man WJ GlcNAcj 
neutral oligosaccharides previously characterized, but each 
with an additional esterified mannose-phosphate group. The 
site of linkage of this group to the oligosaccharide has not 
been characterized, but may be Identical to that found In 
other yeasts. The sites of phosphorylation that have been 
identified in S. cerevis/oe mannoproteins are on both the 



Man,GlcNAc 2 core and outer side chain of N-linked olig- 
osaccharides, and on the short O-linked oligosaccharides 
[7]. Mannose I -phosphate is transferred to carbon 6 of the 
central mannose residue In an acceptor sequence of Man(a- 
l,2)Man(«-l,2) or (a-l,6)Man, and such transfer appears to 
compete with addition of an a- 1,3-Man residue to the non- 
reducing Mana-1,2 residue of the sequence. As a-1,3- 
mannosyltransferase activity is lacking in P. pastoris, it Is 
possible that relatively large quantities of phosphorylated 
oligosaccharides, perhaps both N- and O-linked, would be 
found on native and recombinant glycoproteins. 



Conclusions 

Heterologous proteins that are expressed by P. pastoris cells 
can be glycosylated on the Asn component of the Asn-Xaa- 
Thr/Ser consensus sequence and on Ser or Thr hydroxy 
groups to provide N-llnked and O-linked saccharides, 
respectively. Studies on the structures of N-linked oligo- 
saccharides are consistent with earlier conclusions that a 
common yeast-type core structure of MangGlcNAcj, prob- 
ably originating from a dollchol-linked GICjMan^GlcNACj, Is 
elongated on the 1,3 arm with a chain of a-l,6-linked 
mannose units. These, in turn, can be branched with a- 1,2- 
linked mannose units. The most common N-llnked oligo- 
saccharide that is assembled on proteins is Man^GlcNAcj, 
implying that short a- 1,6 extensions occur, as compared 
with long (>S0) chains In proteins from S. eerewiioe. 
Consistently, a- 1 ,3-linked mannose units are found to be 
absent In P. pastor/s-derived proteins. Phosphorylated oligo- 
saccharides are present probably as Man-l-P-6-Man-R 
phosphodlesters, but exact sites of phosphate linkage on the 
oligosaccharides have not been identified. The O-linked 
saccharides arB generally short (< 5 residues) and contain 
only <x-l,2-!inked mannose units. Attempts to engineer 
alterations in glycosylation processing are in early devel- 
opment but may lead to glycoproteins similar to their 
mammalian counterparts in the future. 
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